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STORAGE OF ADVANCED VIDEO CODING (AVC) PARAMETER SETS 

IN AVC FILE FORMAT 

BACKGROUND OF THE INVENTION 

5 

FIELD OF THE INVENTION 

The present invention generally relates to the Advanced Video Coding (AVC) 
file format and, more particularly, to the storage of AVC Parameter Sets in the AVC 
file format. 

10 

BACKGROUND OF THE INVENTION 

The Advanced Video Coding (AVC) file format provides support for both 
streaming media data over a network as well as local playback. The most common 
streaming servers use the Real Time Streaming Protocol (RTSP) along with the 

15 Session Description Protocol (SDP) as a control protocol for the set up and teardown 
of multimedia streaming sessions between clients and servers. Further, the Real 
Time Transport Protocol (RTP) is used as the protocol for data transport for the 
multimedia information between the servers and clients. 

An AVC codec provides a means for decoupling specific information from the 

20 video stream that is relevant for more than one video frame. It is intended that this 
specific information be sent over a reliable transport protocol (such as, e.g., 
Transmission Control Protocol (TCP)), in addition to being sent over an unreliable 
transport protocol (such as, e.g., User Datagram Protocol (UDP)), embedded within 
the multimedia stream. This specific information combined together form a 

25 Parameter Set. An AVC Parameter Set may include information such as, for 
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example: picture or image size; display window size; macro block allocation map; and 
so forth. 

One or many Parameter Sets may exist for any given AVC video stream. The 
encoder and decoder should maintain an identical list of the Parameter Sets. Each 
5 slice header within the AVC video stream includes a code that indicates the 
Parameter Set to be used when decoding. Based on this code, the decoder can then 
determine which Parameter Set to use for proper decoding. Since the Parameter 
Sets are decoupled from the main video stream, it is possible and desirable to send 
them from a multimedia streaming server to a multimedia streaming client in an out- 
10 of-band fashion. 

A description of a typical scenario of a streaming multimedia transaction will 
now be given. When a multimedia streaming client requests streaming multimedia 
from a server, an initial negotiation is typically done prior to the streaming. The initial 
negotiation sequence involves the client sending a request to the server to obtain 
15 information about a particular stream. Afterwards, the server sends a response to the 
client with the minimal necessary details required for receiving and decoding the 
stream. This initial negotiation is commonly done using a network protocol such as 
the Real Time Streaming Protocol (RTSP). The RTSP protocol can be used to send 
the Parameter Sets from the multimedia streaming server to the multimedia 
20 streaming client. However, due to the nature of the streaming server, for the 
streaming server to send the media dependent data to the client, the Parameter Sets 
must be stored in the file in a manner such that the streaming server does not know 
or care what type of multimedia file it is. 
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Accordingly, it would be desirable and highly advantageous to have a method 
for enabling the RTSP protocol of the multimedia streaming server to transport 
Parameter Sets in a media independent manner to the client. 

5 SUMMARY OF THE INVENTION 

The problems stated above, as well as other related problems of the prior art, 
are solved by the present invention, a method for storing AVC Parameter Sets in the 
AVC file format. The present invention provides an extension to the AVC file format 
for the purpose of storing enhanced metadata to improve streaming services. The 
10 extension involves inserting AVC Parameter Sets into the Hint Track metadata of the 
AVC file format while still being able to provide backwards compatibility with existing 
streaming servers. 

According to an aspect of the present invention, there is provided a method for 
employing Parameter Set information corresponding to an Advanced Video Coding 
15 (AVC) file. The method includes the step of embedding the Parameter Set 
information into a Hint Track of the AVC file. 

According to another aspect of the present invention, there is provided an 
apparatus for employing Parameter Set information corresponding to an Advanced 
Video Coding (AVC) file. The apparatus includes an embedding device for 
20 embedding the Parameter Set information into a Hint Track of the AVC file. 

According to yet another aspect of the present invention, there is provided a 
method for employing Parameter Set information corresponding to an Advanced 
Video Coding (AVC) file. The method includes the steps of embedding the 
Parameter Set information into a Hint Track of the AVC file, and transmitting the 
25 Parameter Set information in an out of band and media independent transmission. 
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These and other aspects, features and advantages of the present invention will 
become apparent from the following detailed description of preferred embodiments, 
which is to be read in connection with the accompanying drawings. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a diagram illustrating an Advanced Video Coding (AVC) file 100 to 
which the present invention may be applied, according to an illustrative embodiment 
of the present invention; 

FIG. 2 is a diagram illustrating an Advanced Video Coding (AVC) video 
10 encoder 200 to which the present invention may be applied, according to an 
illustrative embodiment of the present invention; 

FIG. 3 is a diagram illustrating exemplary metadata 300 output from the AVC 
encoder 200 of FIG. 2, according to an illustrative embodiment of the present 
invention; 

15 FIG. 4 is a diagram illustrating Session Description Protocol (SDP) track 

description information 400, according to an illustrative embodiment of the present 
invention; 

FIG. 5 is a high-level block diagram illustrating an apparatus 500 for inserting 
Parameter Sets into the Hint Tracks of an AVC file 599, according to an illustrative 
20 embodiment of the present invention; 

FIG. 6 is a high-level block diagram illustrating the generation of metadata 
tables and the writing of the metadata tables to an AVC file, according to an 
illustrative embodiment of the present invention; 

FIG. 7 is a high-level block diagram illustrating the writing of media data to an 
25 AVC file, according to an illustrative embodiment of the present invention; and 
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FIG. 8 is a high-level block diagram illustrating the writing of a hint track to an 
AVC file, according to an illustrative embodiment of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

5 The present invention is directed to an extension to the AVC file format, a 

commonly known streaming digital media storage format, for the purpose of storing 
enhanced metadata to improve streaming services. The extension involves inserting 
AVC Parameter Sets into the Hint Track metadata of the AVC file format while still 
being able to provide backwards compatibility with existing streaming servers. The 
10 Parameter Sets are embedded within the AVC file format Hint Track such that the 
RTSP protocol of the multimedia streaming server is able to transport the Parameter 
Sets in a media independent fashion to the client. 

It is to be appreciated that the present invention generally relates to the 
Advanced Video Coding (AVC) file format (International Organization for 
15 Standardization/International Electrotechnical Commission (ISO/IEC) 14496, Part 
15), jointly developed by the International Telecommunication Union (ITU) and the 
Moving Pictures Experts Group (MPEG). However, given the teachings of the 
present invention provided herein, one of ordinary skill in the related art will 
contemplate this and various other file formats to which the present invention may be 
20 applied while maintaining the spirit and scope of the present invention. 

It is to be understood that the present invention may be implemented in 
various forms of hardware, software, firmware, special purpose processors, or a 
combination thereof. Preferably, the present invention is implemented as a 
combination of hardware and software. Moreover, the software is preferably 
25 implemented as an application program tangibly embodied on a program storage 
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device. The application program may be uploaded to, and executed by, a machine 
comprising any suitable architecture. Preferably, the machine is implemented on a 
computer platform having hardware such as one or more central processing units 
(CPU), a random access memory (RAM), and input/output (I/O) interface(s). The 
5 computer platform also includes an operating system and microinstruction code. The 
various processes and functions described herein may either be part of the 
microinstruction code or part of the application program (or a combination thereof) 
that is executed via the operating system. In addition, various other peripheral 
devices may be connected to the computer platform such as an additional data 
10 storage device and a printing device. 

It is to be further understood that, because some of the constituent system 
components and method steps depicted in the accompanying Figures are preferably 
implemented in software, the actual connections between the system components (or 
the process steps) may differ depending upon the manner in which the present 
15 invention is programmed. Given the teachings herein, one of ordinary skill in the 
related art will be able to contemplate these and similar implementations or 
configurations of the present invention. 

The AVC file format provides support for both streaming media data over a 
network as well as local playback. An AVC file that supports streaming multimedia 
20 includes additional information about the data units to stream. This additional 
information is included in additional tracks of the file known as Hint Tracks. A Hint 
Track provides streaming instructions to a server such as, for example, packet 
formatting information to ensure suitable delivery and timing information to ensure 
proper transmission timing. In additional, a Hint Track may also provide more 
25 information such as details on the type of media that the client is requesting, which 
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then may be sent to the client during an initial interaction between the client and 
server. 

The most common streaming servers use the Real Time Streaming Protocol 
(RTSP) along with the Session Description Protocol (SDP) as a control protocol for 
5 the set up and teardown of multimedia streaming sessions between clients and 
servers. Further, the Real Time Transport Protocol (RTP) is used as the protocol for 
data transport for the multimedia information between the servers and clients. It is to 
be appreciated that the present invention is directed towards streaming servers that 
use RTSP as their control protocol for session establishment. Of course, one of 
10 ordinary skill in the related art will readily appreciate that the present invention may 
also be modified and implemented with respect to streaming servers that use a 
protocol other than RTSP as their control protocol, while maintaining the spirit and 
scope of the present invention. 

RTSP is a text-based protocol where messages include a header and payload. 
15 The header is commonly defined by the standard, whereas portions of the payload 
can be application dependent. In the case of an RTSP based multimedia streaming 
server, the server by default inserts Session Description Protocol (SDP) information 
included within the AVC file into the RTSP message during initial media streaming 
negotiation with a client. Therefore, by embedding the AVC Parameter Sets within 
20 the SDP information, it is possible for the server to transmit the Parameter Sets of the 
media stream to the client without having knowledge of the media format. As a 
result, the Parameter Sets are transmitted prior to any media, and in addition, they 
are transmitted using a reliable transport protocol since RTSP by default uses 
Transmission Control Protocol (TCP). 
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FIG. 1 is a diagram illustrating an Advanced Video Coding (AVC) file 100 to 
which the present invention may be applied, according to an illustrative embodiment 
of the present invention. The AVC file 100 includes AVC video track information 160, 
audio track information 165, video hint track information 170, and audio hint track 
5 information 175. The AVC file 100 further includes audio tracks 110, video tracks 
120, and hint tracks 130. The hint tracks 130 may be used for streaming purposes 
according to the present invention. 

The Hint Tracks 130 within the AVC file 100 include special instructions for a 
streaming server to assist in the formation of packets. This information is provided in 
10 such a way that a server is able to packetize the media data in the manner suitable 
for streaming using a specific network transport. In addition to the inclusion of special 
streaming instructions, other data may be embedded within the Hint Track that the 
server is instructed to send immediately to the client (for example and in accordance 
with the present invention, the other data may include Parameter Sets). 
15 The process of storing AVC parameter Sets within the Hint Track of an AVC 

file will now be described. FIG. 2 is a diagram illustrating an Advanced Video Coding 
(AVC) video encoder 2O0 to which the present invention may be applied, according to 
an illustrative embodiment of the present invention. Initially, the AVC video encoder 
200 will have at least one output interface for delivering the compressed video data 
20 and the corresponding metadata to a second module for storage purposes. Also 
included in FIG. 2 is a transmitter 250 for transmitting the Parameter Sets in an out- 
of-band transmission. The Parameter Sets are transmitted while embedded within 
the SDP payload of an AVC file. 

FIG. 3 is a diagram illustrating exemplary metadata 300 output from the AVC 
25 encoder 200 of FIG. 2, according to an illustrative embodiment of the present 
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invention. The metadata 300 that is delivered from the AVC video encoder 200 may 
include information such as the size 210 of the compressed video frame, the type 215 
of the video frame (e.g., l-frame, P-frame, and B-frame), timing information 220 (e.g., 
frame rate), other attributes 225, and new Parameters Sets 230. Of particular 
5 interest in this application is the Parameter Sets 230, because the Parameter Sets 
230 will be stored in the AVC file within the video stream as well as in the Hint Track 
that references the video stream. Thus, for the Parameter Sets 230 to be stored 
within the Hint Tracks, the data being received on the output interface of the AVC 
video encoder will need to be properly managed. 
10 When Parameter Sets 230 are part of the Metadata 300 output from the AVC 

video encoder, they are copied and stored in a repository for later referencing when 
the Hint Track is created. This allows the Hint Track creation to be more streamlined 
and straightforward because the media track does not need to be parsed beginning 
to end to find the Parameter Sets. 
15 FIG. 4 is a diagram illustrating Session Description Protocol (SDP) track 

description information 400, according to an illustrative embodiment of the present 
invention. The AVC Parameter Sets 230 are stored within the SDP track description 
information 400 included within the Hint Track. Storing the Parameter Sets 230 in the 
Hint Track provides a means for transporting the Parameter Sets 230 using a reliable 
20 network transport during the initialization of a stream session. Equally important, 
backwards compatibility is provided to current streaming servers because an 
additional extension to the file is not required. The SDP track description information 
400 further includes track duration information 410, track description information 420, 
and other attributes 430. Table 1 illustrates the storage of AVC Parameter Sets 
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within Session Description Protocol (SDP) track description information, according to 
an illustrative embodiment of the present invention. 



v=0 

s=AVC Streaming Media Program 

u=http:// 

e=admin@ 

a=control:* 

a=range:npt=0 - 60.96 
m=video 0 RTP/AVP 96 
a=rtpmap:96 AVC/9OO00 

a=paramset:"data:video/jvt;base64,MIME encoded Parameter Sets" 

a=cont rol :trackl D=2 

m=audio 0 RTP/AVP 14 

a=rtpmap:14 M PA/22050 

a=control:tracklD=6 

TABLE 1 

Parameter Set information can be encoded in Multipurpose Internet Mail Extensions 
(MIME) prior to being embedded within the SDP payload of the Hint track. 

FIG. 5 is a high-level block diagram illustrating an apparatus 500 for inserting 
Parameter Sets into the Hint Tracks of an AVC file 599, according to an illustrative 
embodiment of the present invention. The apparatus 500 includes a media and 
metadata manager 510, a metadata parser 520, an insertion module 530, a media 
writing module 540, a metadata writing module 550, a hint track writing module 560, 
and a generic file writing module 570. 

Video data 501 and metadata 502 are received by the media and metadata 
manager 510. The media and metadata manager 510 ensures the file writing 
process is done properly and that the file is not written in an inconsistent manner. In 
most cases, the metadata 502 should reside within the file first, followed by the media 
data tracks and the hint tracks. Therefore, the media and metadata manager 510 is 
present to control the writing process. 
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There are three main writing modules (540, 550, 560) that interface to the 
generic file writing module 570. It may appear that each of these modules are 
sending data to the generic file writing simultaneously, but they are actually sending 
data to be written in a particular order (e.g., the order described above, that being 

5 metadata, media data, and hint data). These writing modules will now be further 
described with respect to FIGs. 6-8. 

FIG. 6 is a high-level block diagram illustrating the generation of metadata 
tables and the writing of the metadata tables to an AVC file, according to an 
illustrative embodiment of the present invention. FIG. 7 is a high-level block diagram 

10 illustrating the writing of media data to an AVC file, according to an illustrative 
embodiment of the present invention. FIG. 8 is a high-level block diagram illustrating 
the writing of a hint track to an AVC file, according to an illustrative embodiment of 
the present invention. As noted above, the writing of an AVC Hint Track to an AVC 
file involves extracting the Parameter Sets from the metadata and storing them in the 

15 Hint Track SDP information. 

Referring to FIGS. 6 and 8, metadata is input to the media and metadata 
manger 510. As is known, metadata is a set of data structures for storing media 
dependent information such as, but not limited to, timestamps, frame sizes, 
parameters sets, and so forth. Metadata is used, for example, to keep the 

20 corresponding media in synchronization with additional data for the purposes of, e.g., 
decoding, encoding, displaying, access, and transmission. 

The media and metadata manager 510 extracts the metadata from the media 
and parses the metadata for Parameter sets (see FIG. 8). Thus, if parameter sets 
are present in the metadata, the metadata parser 520 will extract the Parameter Sets 

25 and feed them into the Hint track's SDP data structure. Otherwise, when a 
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Parameter Set is not found, the metadata parser 520 transforms the metadata into 
standardized data structures that provide a means for writing the information to a file. 
The metadata writing module 550 uses the Application Programming Interface (API) 
of the generic writing module 570 to write the metadata to the AVC file 599. 

5 In FIG. 7, the media and metadata manager 510 extracts the media and the 

total size of the media. The data is then fed directly to the media writing module 540. 
The media writing module also uses the API of the generic writing module 570 to 
write the media samples to the AVC file 599. 

Although the illustrative embodiments have been described herein with 

10 reference to the accompanying drawings, it is to be understood that the present 
invention is not limited to those precise embodiments, and that various other changes 
and modifications may be affected therein by one of ordinary skill in the related art 
without departing from the scope or spirit of the invention. All such changes and 
modifications are intended to be included within the scope of the invention as defined 

15 by the appended claims. 



